Search and retrieval of audiovisual content by integrating non-verbal multimodal, affective, and social descriptors
نویسنده
چکیده
One of the research challenges for future search engines concerns the integration of multimodal and cross-modal, nonverbal, full-body, affective, social, and enactive interaction in the process of search and retrieval of audiovisual content. The paper gives a short presentation of the three-year EU project I-SEARCH (EU 7FP ICT STREP), aiming at creating a novel unified framework for multimodal and cross-modal content indexing, sharing, search and retrieval of audiovisual content. A couple of scenarios developing multimodal paradigms of search and retirieval of audiovisual content are introduced and briefly discussed to explain in concrete terms some of the main research challenges that are addressed in I-SEARCH. Finally, the paper presents preliminary results on a specific research challenge: analysis of nonverbal expressive and social behaviour to extract useful information from users for the retrieval of audiovisual content.
منابع مشابه
Audiovisual integration of emotional signals in voice and face: An event-related fMRI study
In a natural environment, non-verbal emotional communication is multimodal (i.e. speech melody, facial expression) and multifaceted concerning the variety of expressed emotions. Understanding these communicative signals and integrating them into a common percept is paramount to successful social behaviour. While many previous studies have focused on the neurobiology of emotional communication i...
متن کاملSemantic Encoding and Markup of Georeferenced Documents in Polythematic Digital Libraries of Scientific Literature
The paper considers the principles and basic stages of decomposing georeferenced documents oriented to the problems of markup and semantic search. The paper justifies the necessity to develop a multimodal semiotic system and discusses verbal-visual knowledge representation in digital libraries. To represent knowledge, verbal-visual thesaurus is proposed. The thesaurus includes verbal, verbal-vi...
متن کاملUsing MPEG-7 for Automatic Annotation of Audiovisual Content in eLearning Digital Libraries
In this paper we present a prototype system to enrich audiovisual contents with annotations, which exploits existing technologies for automatic extraction of metadata (such as OCR, speech recognition, cut detection, visual descriptors, etc.). The prototype relies on a metadata model that unifies MPEG-7 and LOM descriptions to edit and enrich audiovisual contents, and it is based on MILOS, a gen...
متن کاملMultimedia search and retrieval using multimodal annotation propagation and indexing techniques
In this paper, a novel framework for multimodal search and retrieval of rich media objects is presented. The searchable items are media representations consisting of multiple modalities, such as 2D images, 3D objects and audio files, which share a common semantic concept. A manifold learning technique based on Laplacian Eigenmaps was appropriately modified in order to merge the low-level descri...
متن کاملPredicting Student Performance in Verbal Math Problems Based on Cognitive, Metacognitive, and Affective Factors
Predicting Student Performance in Verbal Math Problems Based on Cognitive, Metacognitive, and Affective Factors F. Karimi, Ph.D. A.R. Moraadi, Ph.D. P. Kadivar, Ph.D. R. Kormi Noori, Ph.D. To determine the predictive role of metacognitive, cognitive, and affective factors in solving verbal math problems, a cluster sample of 450 junior high school students was given ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010